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SUMMARY 


A  human  basal  ganglia  cDNA  library  was  screened  with 
oligonucleotide  probes  corresponding  to  portions  of  the  amino 
acid  sequence  of  human  serum  cholinesterase  (EC  3.1. 1.8).  Five 
overlapping  clones,  representing  2.4  kilobases  (kb),  were 
isolated.  The  sequenced  cDNA  contained  212  base  pairs  (bp)  of 
coding  sequence  5'  to  the  N-terminus  of  the  mature  protein  in 
which  there  were  four  ATG  translation  start  sites  in  the  same 
reading  frame  as  the  protein.  Only  the  ATG  coding  for  Met  -28 
lay  within  a  favorable  consensus  sequence  for  functional 
initiators.  There  were  1722  bp  of  coding  sequence  corresponding 
to  the  soluble,  globular,  G4  form  of  cholinesterase.  The  amino 
acid  sequence  deduced  from  the  cDNA  exactly  matched  the  574  amino 
acid  sequence  of  human  serum  cholinesterase,  which  had  previously 
been  determined  by  Edman  degradation.  Therefore,  our  clones 
represented  cholinesterase  (EC  3. 1.1. 8)  rather  than 
acetylcholinesterase  (EC  3. 1.1.7).  It  was  concluded  that  the 
amino  acid  sequence  of  cholinesterase  from  two  different  tissues, 
human  brain  and  human  serum,  was  identical.  Expression  of  a 
recombinant  plasmid  in  bacterial  cells  yielded  cholinesterase 
protein  but  no  cholinesterase  activity. 

Hybridization  of  genomic  DNA  blots  suggested  that  a  single 
gene,  or  very  few  genes,  coded  for  cholinesterase.  Multiple 
mRNAs  of  different  lengths  were  found  on  Northern  blot  analysis 
of  human  liver  and  brain.  A  few  of  these  mRNA  sizes  may  be 
explained  by  transcription  termination  at  multiple  poly(A) 
sites.  One  genomic  clone  was  isolated  from  a  human  genomic 
library  by  screening  with  cholinesterase  cDNA.  The  clone 
contained  exons  1  and  2.  We  are  presently  searching  for  a  genomic 
clone  containing  exons  3  and  4.  The  minimum  length  of  the 
cholinesterase  gene  is  17  kb. 


FOREWORD 


Citations  of  commercial  organizations  and  trade  names  in  this 
report  do  not  constitute  an  official  Department  of  the  Army 
endorsement  or  approval  of  the  products  or  services  of  these 
organizations . 

The  investigator ( s)  have  abided  by  the  National  Institutes  of 
Health  Guidelines  for  Research  Involving  Recombinant  DNA 
Molecules  (April  1982)  and  the  Administrative  Practices 
Supplements . 

For  the  protection  of  human  subjects,  the  investigator(s) 
have  adhered  to  policies  of  applicable  Federal  Law  45CFR46. 
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INTRODUCTION 


In  our  Annual  Report  for  1986  (1),  we  reported  isolation  and 
sequencing  of  one  cDNA  clone  coding  for  a  portion  of  human 
cholinesterase.  This  clone  contained  a  total  of  1440  bases,  of 
which  1138  coded  for  phenylalanine  195  through  leucine  574,  while 
302  bases  were  in  the  3'  untranslated  region.  From  amino  acid 
sequencing  (2)  we  knew  the  cholinesterase  subunit  contained  a 
total  574  amino  acids  and  that  therefore  our  first  clone 
represented  only  66%  of  the  mature  cholinesterase  protein,  in 
the  present  report  we  have  isolated  overlapping  cDNA  clones  from 
a  human  brain  cDNA  library  and  have  completed  sequencing 
cholinesterase  cDNA  (3). 

Cholinesterase ( EC  3. 1.1. 8)  and  acetylcholinesterase  (EC 
3. 1.1. 7)  are  generally  considered  to  be  products  of  different 
genes  (4).  The  evidence  includes  the  observation  that  there  is 
no  immunological  cross-reaction  between  them  (5,6).  Furthermore, 
there  are  known  genetic  variants  of  human  cholinesterase, 
including  the  silent  variant  with  zero  activity  in  serum  (7),  but 
no  corresponding  genetic  variants  of  acetylcholinesterase.  The 
function  of  cholinesterase  is  unknown,  while  the  function  of 
acetylcholinesterase  is  to  terminate  nerve  impulse  transmission 
at  cholinergic  synapses.  Cholinesterase  and  acetylcholinesterase 
are  highly  similar,  but  not  identical,  with  regard  to  substrate 
preference,  the  types  of  chemicals  that  inhibit  activity,  tissue 
distribution,  and  occurrence  as  globular  and  asymmetric  molecular 
forms  (4).  Human  cholinesterase  and  Torpedo  acetylcholinesterase 
are  54%  identical  in  amino  acid  sequence  ( 2 ) .  They  have  nearly 
the  same  number  of  amino  acids  per  catalytic  subunit,  574  and  575 
respectively.  Their  differences  in  molecular  weights  are  due  to 
differences  in  glycosylation,  acetylcholinesterase  having  four 
and  cholinesterase  having  nine  carbohydrate  chains  per  subunit 
(2,8).  Their  disulfide  bonds  are  located  in  precisely  the  same 
locations,  suggesting  similar  protein  folding  (9,10).  The 
present  report  shows  the  complete  nucleotide  sequence  of  human 
cholinesterase  cDNA  and  the  partial  sequence  of  a  genomic  clone. 

This  work  is  of  interest  to  the  Army  because  of  its  potential 
application  to  defense  against  nerve  gas.  The  cholinesterases 
are  the  primary  targets  of  nerve  gas.  The  cholinesterase  in 
serum  acts  as  a  scavenger  of  nerve  gas,  reacting  rapidly  to 
detoxify  it.  Our  present  results  will  be  the  basis  for 
expression  of  cholinesterase  by  recombinant  DNA  techniques.  This 
may  lead  to  production  of  a  small  cholinesterase-like  molecule 
that  can  detoxify  nerve  gas. 


METHODS 


cDNA  library  screening.  A  human  cDNA  library  from  the  basal 
ganglia  of  a  1-day-old  brain  was  kindly  provided  by  Dr.  Robert  A. 
Lazzarini  (11).  This  same  library  is  now  available  from  the 
American  Type  Culture  Collection,  Rockville,  MD.  The  cDNA  was 
cloned  into  lambda  gtll  via  Eco  RI  linkers.  The  library  contains 
approximately  1  million  independent  recombinants.  Screening  was 
carried  out  by  the  iji  situ  plaque  hybridization  method  of  Benton 
and  Davis  (12)  as  modified  by  Maniatis  et  al.  (13).  Table  1 
lists  the  oligonucleotide  probes  used  for  finding  the  first  three 
positive  clones,  which  are  called  OH57,  Z35,  and  Z3  in  Figure  1. 
Design  of  the  oligonucleotide  probes  was  based  on  the  previously 
obtained  amino  acid  sequence  of  human  serum  cholinesterase  (2). 
Oligonucleotides  5  and  6  were  synthesized  by  the  University  of 
Michigan  DNA-synthesizing  facility.  All  other  oligonucleotides 
were  synthesized  by  Pharmacia  PL-Biochemical  Corp,,  Milwaukee, 
WI.  The  oligonucleotides  were  labeled  with  gamma- i  ZP ] -ATP  using 
T4  polynucleotide  kinase.  Prehybridization  as  well  as 
hybridization  was  in  0.25%  nonfat  dry  milk  (14)  dissolved  in  6  x 
SSC  (6  x  SSC  is  0.9  M  NaCl  and  0.09  M  Na  Citrate,  pH  7.0). 
Hybridization  was  for  12-20  h  at  36°C  for  all  oligonucleotides 
except  No.  5  which  was  hybridized  at  46°C.  Posthybridization 
washes  were  at  room  temperature  in  6  x  SSC  and  0.05% 
pyrophosphate,  followed  by  a  10-min  wash  at  41-43°C  in  6  x  SSC 
and  0.05%  pyrophosphate  for  probes  3,  4,  7,  8,  9,  and  10.  More 
stringent  conditions  were  used  for  the  final  wash  for  probes  5 
and  6.  Probe  5  was  washed  for  10  min  at  46°C  in  2  x  SSC  and  0.1% 
sodium  dod-cyl  sulfate  (SDS).  Probe  6  was  washed  for  10  min  at 
46°C  in  6  x  SSC  and  0.05%  pyrophosphate. 

Additional  details  of  isolation  and  sequencing  of  clone  Z3 
are  in  our  first  Annual  Report  (1).  A  130-base  pair  (bp) 
fragment  of  clone  Z3  was  used  as  a  probe  to  find  the  overlapping 
clone  Z2.  The  130-bp  fragment  had  been  produced  by  digestion  with 
Sau  3A1.  It  was  subcloned  into  phage  Ml 3  where  it  was 
radiolabeled  by  synthesizing  a  labeled  complement  strand.  The 
double-stranded  fragment  was  released  from  M13  by  digestion  with 
Bam  HI  and  Eco  RI.  It  was  purified  by  polyacrylamide  gel 
electrophoresis  (PAGE)  and  electroeluted  into  a  dialysis  bag. 
Ong  hundred  thousand  plaques  were  screened  by  hybridization  at 
60°C,  followed  by  three  brief  washes  at  room  temperature  in  2  x 
SSC  and  0.1%  SDS,  and  a  10-min  wash  at  59°C  in  0.1  x  SSC  and  0.2% 
SDS . 


A  270-bp  fragment  of  clone  Z2  was  used  as  a  probe  to  find 
overlapping  clone  Z13.  The  270-bp  fragment  was  produced  by 
digesting  clone  Z2  with  Bam  HI  and  Eco  RI .  The  Eco  RI  site  was  in 
the  linker  joining  human  DNA  to  the  lambda  arm  and  was  not  in  the 
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cholinesterase  cDNA.  The  270-bp  fragment  was  amplified  in 
plasmid  pUC18.  The  270-bp  fragment  was  isolated  on  low  melting 
agarose  and  radiolabeled  by  the  random  oligolabeling  method  of 
Feinberg  and  Vogelstein  (15).  Hybridization  was  at  68°C. 
Posthybridization  washes  were  with  0.1  x  SSC  and  0.5%  SDS.  Three 
brief  washes  at  room  temperature  were  followed  by  a  120-min  wash 
at  68°C . 

DNA  sequencing.  Sequencing  was  by  the  dideoxy  method  of 
Sanger  et  al .  ( 16 ) .  Restriction  fragments  were  cloned  into 
Ml3mpl8  and  Ml3mpl9.  Primers  for  the  sequencing  reaction 
included  the  universal  primer,  oligonucleotides  1  and  2  in  Table 
1,  and  lambda  gtll  primers. 

Computer  analysis.  The  Protein  Identification  Resource, 
National  Biomedical  Research  Foundation,  Georgetown  University 
Medical  Center,  3900  Reservoir  Road,  N.W.,  Washington,  D.C. 
20007,  contains  computer  programs  which  allowed  us  to  store  and 
align  our  DNA  sequence  files  and  to  compare  the  DNA  sequences  of 
our  clones  to  those  in  the  computer  data  bank. 

Construction  of  bacterial  expression  vector.  The  plasmid 
expession  vector,  pGEM  3ZF-,  was  purchased  from  Promega,  Madison, 
WI.  We  chose  this  vector  because  it  could  be  used  to  achieve  two 
goals:  to  study  expression  of  cholinesterase  in  a  bacterial  host, 
and  to  produce  messenger  RNA  for  use  as  riboprobes  in  Northern 
blots . 

A  full-length  cholinesterase  cDNA  was  produced  by  ligating  a 
0.9-kilobase  (kb)  Pvu  II/Bam  HI  fragment  from  the  5'  end  of  clone 
Z13  to  a  2-kb  Bam  HI/Eco  RI  fragment  from  the  3'  end  of  clone 
Z19.  Figure  2  is  a  schematic  diagram  of  clones  Z13  and  Z19 
showing  the  location  of  restriction  sites  used  to  construct  the 
full-length  cDNA.  The  resulting  2.9-kb  fragment  was  ligated  into 
the  Sma  I  and  Eco  RI  sites  of  the  polylinker  region  of  pGEM  3ZF-. 
The  Pvu  II  end  of  the  cDNA  was  attached  to  the  Sma  I  site  of  the 
vector  by  blunt  end  ligation.  This  was  possible  because  both  Pvu 
II  and  Sma  I  yield  blunt  ends.  In  this  forced  orientation  the  29 
amino  acids  coding  for  the  SP6  promoter,  the  lac  Z  peptide,  and 
the  polylinker  are  on  the  5'  side  of  the  cholinesterase  cDNA. 
The  ligations  were  carried  out  simultaneously  so  that  Z13  and  Z19 
fragments  as  well  as  the  pGEM  vector  were  in  one  tube. 
Transformation  was  into  Escherichia  coli  strain  JM109  (17). 
Transformants  were  selected  by  ampicillTn  resistance.  Colonies 
containing  recombinant  plasmid  were  identified  by  absence  of 
beta-galactosidase  activity  which  caused  colonies  to  be  white 
rather  than  blue  in  the  presence  of  isopropyl-D-thiogalacto 
pyranoside  (IPTG)  and  5-bromo-4-chloro-3-indolyl-D-galactoside 
(X-gal).  Seven  white  colonies  resistant  to  ampicillin  were 
found.  Plasmid  DNA  was  prepared  from  all  seven  colonies  and 


subjected  to  restriction  mapping.  Only  two  of  the  seven  colonies 
had  the  correct  restriction  map  for  cholinesterase.  These  two 
recombinant  plasmids,  containing  2.9-kb  of  cholinesterase  cDNA 
and  3.2-kb  of  pGEM  DNA,  were  called  pBCHE  1  and  pBCHE  2.  All 
further  experiments  used  pBCHE  1  as  the  two  plasmids  were 
identical . 

Detection  of  expression  product.  pBCHE-transformed  cells 
were  grown  in  2  ml  of  LB  medium  containing  250  ug/ml  ampicillin, 
at  37°C,  overnight,  in  a  shaking  incubator  set  at  150  rpm.  An 
aliquot  of  the  overnight  culture  was  diluted  to  an  absorbance  at 
600  nm  of  0.05,  using  50  ml  of  LB  medium  containing  250  ug/ml 
ampicillin.  The  diluted  aliquot  was  grown  for  approximately  1  h 
and  15  min  to  an  absorbance  at  600  nm  of  1.0,  after  which  IPTG 
was  added  to  a  concentration  of  0.12  g  in  50  ml.  Incubation  was 
continued  for  another  1-  2  h.  The  IPTG  induced  the  lac  promoter 
to  make  cholinesterase  fusion  protein.  IPTG  had  to  be  added 
during  log  phase  growth  and  not  earlier  because  the 
cholinesterase  fusion  product  was  deleterious  to  growth.  Cells 
were  pelleted  by  centrifugation  of  the  50-ml  culture. 
Periplasmic  proteins  were  extracted  from  the  cell  pellet  with  2 
ml  of  25%  sucrose  containing  50  mM  Tris-Cl,  pH  8.0,  and  0.5  mg/ml 
lysozyme.  Proteins  remaining  after  another  centrifugation  were 
solubilized  in  1  ml  of  2%  SDS,  50  mM  Tris-Cl,  pH  7.0,  and  5% 
mercaptoethanol . 

Cell  lysate  was  tested  for  the  presence  of  cholinesterase  by 
1)  SDS-PAGE  stained  for  protein  with  Coomassie  blue,  2)  SDS-PAGE 
followed  by  electrotransfer  of  proteins  to  nitrocellulose, 
washing  with  cholinesterase  rabbit  antiserum,  washing  with 
horseradish  peroxidase-conjugated  goat  anti-rabbit  lg(H+L) 
antibody,  and  staining  with  hydrogen  peroxide  and 
4-chloro-l-naphthol .  The  human  cholinesterase  polyclonal 
antibody  was  purchased  from  Calbiochem,  San  Diego,  CA.  The 
horseradish  peroxidase-conjugated  goat  anti-rabbit  lg(H+L) 
antibody  was  from  Bio-Rad,  Richmond,  CA.  3)  A  third  method  for 
detecting  cholinesterase  was  nondenaturing  PAGE  stained  for 
esterase  activity  with  alpha-naphthyl  acetate  and  Fast  Red  TR  or 
Fast  Blue  RR  ( 18) . 

Riboprobe .  Radiolabeled  antisense  RNA  was  prepared  by 
cell-free  transcription  of  the  recombinant  plasmid  pBCHE  after 
pBCHE  had  been  linearized  with  Hind  III.  Cleavage  by  Hind  III 
was  at  the  5'  cDNA/plasmid  linker  junction.  RNA  synthesis  cannot 
proceed  beyond  the  cleavage  point,  thus  Hind  III  cleavage  defined 
the  end  of  the  transcript.  The  RNA  synthesis  reaction  in  a  total 
volume  of  20  ul  contained  40  mM  Tris-Cl,  pH  7.9,  6  mM  MgCl2/  2  mM 
spermidine,  0.5  mM  concentration  each  of  rATP ,  rUTP,  andz  rGTP , 
0.012  mM  rCTP,  10  mM  dithiothreitol ,  1  ,unit/ul  of  RNasin 
ribonuclease  inhibitor,  50  uCi  of  alpha-[JZP)  CTP  (Amersham, 
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Arlington  Heights,  IL,  410  Ci/mmol ) ,  0.2-1  ug  of  linearized 
plasmid  DNA,  and  10  units  of  T7  RNA  polymerase  (Promega,  Madison, 
WI )  (19).  After  1.5  h  reaction  at  37°C,  the  RNA  was  extracted 
with  phenol/chloroform  and  precipitated  with  sodium  acetate  and 
ethanol  in  the  presence  of  carrier  RNA. 

Northern  blot.  Poly  A(  +  )  RNA  from  human  liver  and  human 
brain  was  purchased  from  Clontech,  Palo  Alto,  CA.  The  RNA  was 
denatured  with  glyoxal  and  dimethylsulfoxide  (13)  and 
electrophoresed  on  1.0%  agarose.  To  maintain  constant  pH,  the 
buffer  (12  mM  Tris,  6  mM  sodium  acetate,  and  0.3  mM  EDTA )  was 
recirculated  during  electrophoresis.  After  electrophoresis  the 
gel  was  blotted  onto  Gene  Screen  Plus  Membrane  (Du  Pont  NEN, 
Wilmington,  DE).  Prehybridization  was  in  50%  formamide,  1%  SDS ,  1 
M  NaCl,  and  10%  dextran  sulfate  at  37°C  for  6  h.  Hybridization 
with  200  uCi  of  riboprobe  in  10  ml  of  fresh  solution  was  for  48  h 
at  37°C.  Posthybridization  washes  were  in  2  x  SSC  and  1%  SDS  at 
50°C  for  30  min,  0.1  X  SSC  at  25  C  for  30  min,  2  X  SSC  and  1%  SDS 
at  65°C  for  30  min,  and  0.1  X  SSC  at  50°C  for  30  min  (20). 

Screening  a  human  genomic  library.  A  genomic  library  was 
kindly  provided  By  Dr .  Patrick  Venta  of  the  University  of 
Michigan.  The  library  was  constructed  from  white  cell  DNA 
purified  from  the  blood  of  a  single  donor.  A  partial  Sau  3A 
digest  was  ligated  into  the  Bam  HI  site  of  the  polylinker  region 
of  the  vector  EMBL  3  (21).  Human  DNA  inserts  could  be  removed  by 
digestion  with  Sal  I.  The  average  size  of  the  inserts  was  15-20 
kb.  The  library  contains  1  million  independent  recombinants. 

The  library  was  plated  at  a  density  of  50,000  plaques/  150-mm 
plate  on  40  plates.  Screening  was  carried  out  by  the  ijri  situ 
hybridization  method  of  Benton  and  Davis  as  modified  by  Maniatis 
et  al.  (12,13).  After  baking  under  vacuum  at  80°C  for  2  h,  the 
nitrocellulose  filters  (Millipore,  Bedford,  MA  or  Schleicher  & 
Schuell,  Keene,  NH)  were  washed  at  37°C  in  3  X  SSC  and  0.1%  SDS 
for  6  h  and  then  prehybridized  at  68°C  in  6  X  SSC  and  0.25% 
nonfat  dry  milk  overnight.  Filters  were  hybridized  18  h  with  2-4 
ng/ml  of  probe  at  68°C  in  6  X  SSC  and  0.25%  nonfat  dry  milk. 
After  hybridization  the  filters  were  washed  in  2  x  SSC  and  0.1% 
SDS  at  68°C  for  4  h  with  three  wash  fluid  changes. 

Purification  of  DNA  from  EMBL  3  genomic  clones.  The  host  was 
E.  coli  strain  LE392.  A  high  titer  stock  of  the  plaque-purified 
cTone  wgs  prepared  by  incubating  100  ul  of  competent  LE392  cells 
with  10°  phage  (one  plaque).  The  phage  were  adsorbed  to  the 
LE392  cells  for  20  min  at  37°C.  After  adsorption  4  ml  of  LB  broth 
supplemented  with  10  mM  MgSO.  and  0.005%  thymidine  was  added. 
The  liquid  culture  was  incubated  at  37°C  with  vigorous  shaking 
until  lysis^l6-9  h)  occurred,  at  which  time  the  supernatant  had  a 
titer  of  lO-1  plaque-forming  units/ml.  Approximately  5  x  10° 
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plaques  were  used  to  infect  0.3  ml  of  a  fresh  overnight  culture 
of  LE392.  After  20-rain  incubation  to  allow  adsorption  of  the 
phage  to  the  bacterial  cells,  the  inoculum  was  added  to  500  ml  of 
NZYM  medium  and  grown  for  9-12  h.  NZYM  medium  contains  in  a 
total  volume  of  1  liter,  10  g  of  casein  (enzymatic  hydrolysate 
from  bovine  milk.  No.  C-0626  from  Sigma,  St.  Louis,  MO),  5  g  of 
NaCl,  5  g  of  Bacto-yeast  extract  (Difco  Laboratories,  Detroit, 
MI),  and  2  g  of  MgSO..  7H~0.  After  the  solids  have  dissolved  the 
pH  is  adjusted  to  7.5  with^  NaOH  (13).  DNA  was  prepared  from  the 
culture  supernatant  by  standard  procedures  (13). 

Genomic  blots.  Human  placental  DNA  (Sigma,  St.  Louis,  MO) 
was  diluted  to  50  ug/ml  for  digestion  with  various  restriction 
enzymes.  After  digestion  the  samples  were  extracted  with  phenol 
and  chloroform  and  then  precipitated  with  ethanol  and  glycogen 
(20  ug).  The  50  ug  of  digested  DNA  was  dissolved  in  50  ul  of  20 
mM  Tris-Cl,  pH  7.6,  and  0.1  mM  EDTA  buffer.  Gel  electrophoresis 
was  in  300  ml  of  1.2%  agarose  in  an  18  x  30  cm  gel  form,  at  40 
volts  for  24  h.  The  DNA  fragments  were  transferred  to 
nitrocellulose  by  the  method  of  Southern  as  described  by  Maniatis 
et  al.  (13).  Filters  were  hybridized  and  washed  at  68°C  as 
already  described. 
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RESULTS 


The  oligonucleotides  listed  in  Table  1  were  synthesized  to 
use  as  probes  for  human  cholinesterase  cDNA.  Our  laboratory  had 
previously  determined  the  complete  amino  acid  sequence  of  human 
serum  cholinesterase  by  Edman  degradation  of  overlapping  peptides 
(2).  The  probes  correspond  to  regions  of  the  amino  acid  sequence 
representing  minimum  codon  ambiguity.  Our  most  successful 
oligonucleotide  probe  proved  to  be  the  unique  sequence  53mer 
(probe  5)  because  it  could  be  used  at  the  highest  stringency. 
The  first  clone,  OH57,  was  found  by  using  probe  5  alone.  The 
53mer  probe  matched  the  cDNA  clone  in  50  of  53  nucleotides. 

Clone  Z3  in  Figure  1  was  found  by  screening  150,000  plaques 
of  the  human  brain  cDNA  library  after  making  four  nitrocellulose 
lifts  from  each  plate  (1).  One  filter  was  hybridized  with  the 
53mer  probe  (No.  5),  one  with  the  39mer  probe  (No.  6),  one  with 
a  mixture  of  probes  4,  8,  and  10,  and  one  filter  was  hybridized 
with  a  mixture  of  probes  3,  7,  and  9.  Clone  Z3  was  positive  on 
all  four  filters.  Clone  Z35  was  found  by  hybridizing  duplicate 
filters  with  two  different  probes,  probes  3  and  4.  Overlapping 
clones  which  completed  the  coding  sequence  at  the  5'  end  were 
found  by  using  fragments  of  clones  Z3  and  Z2  as  probes.  The 
130-bp  fragment  of  Z3  indicated  in  Figure  1  was  used  to  screen 
100,000  plaques  which  led  to  the  isolation  of  clone  Z2.  The 
270-bp  fragment  of  Z2  was  used  to  screen  2  million  plaques  which 
led  to  isolation  of  clone  Z13. 

The  location  of  one  intron  was  suggested  from  the  sequencing 
results  of  clone  Z35.  This  clone  had  105  extra  nucleotides  at  its 
5'  end  in  addition  to  the  614  nucleotides  indicated  in  Figure  1. 
The  extra  105  nucleotides  did  not  match  the  known  amino  acid 
sequence  and  were  absent  in  clone  Z3  as  determined  by  DNA 
sequencing.  The  420  bp  of  Z35  immediately  downstream  from  these 
105  nucleotides  matched  clone  Z3  in  all  but  two  nucleotides.  The 
junction  between  the  105  nucleotides  and  the  rest  of  clone  Z35 
contained  the  sequence  TTCCATATTTTACAGG,  which  is  similar  to  the 
consensus  sequence  present  at  the  3'  splice  junction  of  an  intron 
(22).  Thus,  it  appears  that  the  105  nucleotides  belong  to  an 
intron,  and  that  this  intron  is  located  between  nucleotides  1600 
and  1601  in  Figure  3.  We  considered  the  possibility  that  these 
105  extra  nucleotides  represented  alternative  splicing,  leading 
to  a  cholinesterase  with  an  extra  or  different  peptide  near  its 
carboxyl  terminus.  However,  there  were  five  stop  codons  within 
the  105  nucleotides,  a  result  compatible  with  an  intron  but  not 
with  a  coding  region.  It  is  not  uncommon  to  find  unspliced 
introns  or  possible  cloning  artifacts  in  cDNA  clones  (23,24).  We 
have  not  ruled  out  the  interpretation  that  these  105  nucleotides 
may  be  a  cloning  artifact  rather  than  an  intron. 


-15- 


The  nucleotide  sequence  is  shown  in  Figure  3.  The 
cholinesterase  found  in  human  serum  corresponds  to  nucleotides  1 
through  1722  and  contains  574  amino  acids.  The  amino  acid 
sequence  derived  from  the  nucleotide  sequence  of  the  brain  cDNA 
is  a  perfect  match  with  the  amino  acid  sequence  of  cholinesterase 
found  in  human  serum.  This  confirms  the  amino  acid  sequence 
determined  by  Edman  degradation,  including  those  regions  where  an 
overlap  had  not  been  obtained  but  depended  on  homology  with 
Torpedo  acetylcholinesterase.  Of  special  interest  was  the 
finding  that  asparagine  occurred  at  residues  17,  57,  106,  241, 
256,  341,  455,  461,  and  486.  We  had  reported  that  these  nine 
asparagines  were  glycosylated  (2).  Since  asparagine  cannot  be 
detected  by  amino  acid  composition  analysis  due  to  hydrolysis  of 
the  amide  by  HCl,  and  since  glycosylated  asparagine  is  not 
detected  as  a  phenyl thiohydantoin-amino  acid,  the  identification 
of  asparagine  had  been  based  on  indirect  evidence.  The 
nucleotide  sequence  shown  here  supports  the  conclusion  that  these 
residues  are  asparagines  and  that  they  are  linked  to  carbohydrate 
chains . 

Southern  blots  of  human  placental  DNA  that  had  been  digested 
with  various  restriction  enzymes  yielded  the  results  shown  in 
Figure  4.  The  enzymes  and  the  DNA  fragment  sizes  were  Eco  Ri 
(2.45  kb).  Hind  III  (4.95  kb).  Bam  HI  (no  fragment),  Kpn  I  (no 
fragment),  Pst  I  (14  kb),  Eco  RI  and  Bam  HI  (1.9  kb),  Bam  HI  and 
Hind  III  (3.1  and  1.85  kb),  Eco  RI  and  Hind  III  (2.45  kb),  Eco  RI 
and  Xba  I  (1.9  kb),  and  Eco  RI  and  Pst  I  (2.45  kb).  Other  bands 
and  spots  that  appear  in  Figure  4  but  are  not  listed  above,  were 
irreproducible  and  therefore  are  considered  to  be  artifacts.  The 
simple  pattern  of  bands  in  Figure  4,  where  only  one  or  two  bands 
in  each  lane  hybridizes,  indicates  that  there  may  be  a  single 
copy  of  the  cholinesterase  gene  or  very  few  copies.  A  more 
complex  pattern  would  be  expected  for  multiple  genes  because 
intron  sequences  would  not  be  conserved  even  if  exons  were  highly 
conserved. 

Northern  blots  using  10  ug  poly  A(+)  mRNA/lane  from  human 
liver  and  human  brain  were  hybridized  with  a  2.8-kb  riboprobe. 
See  Figure  5.  The  result  was  that  the  ribosomal  RNA  gave  the  most 
intense  bands.  Ribosomal  RNA  is  identified  by  its  size  of  1.85 
and  4.4  kb  and  is  referred  to  as  18  S  and  28  S  RNA.  In  addition 
to  these  intense  ribosomal  RNA  bands,  there  were  six  faint  bands 
whose  sizes  were  1.0,  1.45,  2.45,  2.8,  3.4,  and  4.2  kb.  The 
banding  pattern  was  exactly  the  same  for  human  liver  mRNA  and 
human  brain  mRNA.  We  consider  these  results  tentative  because 
the  riboprobe  hybridized  with  the  ribosomal  RNA  and  was  therefore 
not  completely  specific. 

Expression  in  bacteria.  Bacterial  cells  transfected  with 
pBCHE  expressed  cholinesterase  protein  as  judged  by 
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cross-reaction  with  human  cholinesterase  antibodies.  The  size  of 
the  immunoreactive  protein  was  65,000  on  SDS  gel 
electrophoresis.  See  Figure  6.  This  size  is  consistent  with  a 
cholinesterase  subunit  from  which  the  28  amino  acids  of  the 
signal  peptide  and  the  29  amino  acids  of  the  vector  have  been 
cleaved  off.  The  size  of  65,000  daltons  is  also  consistent  with 
a  subunit  free  of  carbohydrates.  The  fusion  protein  would  have 
had  a  molecular  weight  of  75,000  per  subunit. 

The  expression  product  was  tested  for  cholinesterase  activity 
by  electrophoresing  extracts  on  nondenaturing  polyacrylamide  gel 
and  staining  for  activity  with  alpha-naphthyl  acetate.  No 
cholinesterase  activity  was  detected.  Two  bacterial  esterases 
were  present  in  all  samples,  even  in  JM109  cells  containing  no 
recombinant  plasmid,  and  these  two  esterases  migrated  much  faster 
than  cholinesterase.  The  yield  of  cholinesterase  protein  was 
estimated  to  be  3  ug/  50-ml  culture,  from  gels  stained  with 
Coomassie  blue. 

Isolation  of  genomic  clone.  One  positive  clone,  P117N,  was 
found  by  screening  the  genomic  library  with  probe  OH5"’  Figure  7 
shows  a  partial  restriction  map  of  clone  P117N.  Its  total  length 
is  19  kb.  Clone  P117N  contains  the  first  two  exons  of 
cholinesterase  but  does  not  contain  exons  coding  for  amino  acids 
479-574  and  the  3'  untranslated  region.  The  position  of  the  two 
exons  was  determined  by  restriction  mapping  and  hybridization. 
To  establish  the  exact  length  of  exon  2  and  to  confirm  that  exon 
2  was  uninterrupted,  all  of  exon  2  was  sequenced.  The 
intron/exon  junctions  for  exon  2  were  also  sequenced.  We  have 
not  yet  completed  sequencing  the  intron/exon  junctions  of  exon  1. 

Figure  8  shows  the  nucleotide  sequences  at  the  intron/exon 
boundaries  as  we  know  them  to  date.  Intron  1  is  located  between 
nucleotides  -93  and  -92  and  has  a  length  of  4.0  kb.  Intron  2  is 
located  between  nucleotides  1433  and  1434  and  has  a  length  of  8.5 
kb.  Intron  3  is  located  between  nucleotides  1600  and  1601  and 
has  a  length  of  2.3  kb.  The  sizes  of  introns  2  and  3  are  based 
on  restriction  mapping  of  total  genomic  DNA  and  are  tentative. 
We  expect  to  get  more  definitive  sizes  for  these  introns  once  we 
have  isolated  a  genomic  clone  containing  exons  3  and  4.  The 
intron/exon  boundary  information  for  exon  4  is  from  the 
unprocessed  cDNA  clone  Z35. 

DNA  sequencing  of  clone  P117N  has  added  67  nucleotides  to  the 
5'  end  of  the  known  cDNA  sequence.  It  is  likely  that  these  67 
nucleotides  belong  to  exon  1  because  they  do  not  contain  the  TATA 
box,  which  is  expected  to  precede  exon  1  by  20-30  nucleotides. 
Figure  9  shows  all  of  the  nucleotide  sequence  information  we  have 
to  date  for  the  cholinesterase  gene  including  the  partial 
sequences  of  introns.  Figure  10  shows  a  tentative  map  of  the 
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cholinesterase  gene  and  suggests  that  the  total  length  of  the 
gene  is  17  kb  or  greater. 


I 
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DISCUSSION 


The  coding  region  of  the  cDNA  clone  isolated  from  human  brain 
is  a  perfect  match  with  the  amino  acid  sequence  of  human  serum 
cholinesterase  (2).  Therefore,  we  conclude  that  the  amino  acid 
sequence  of  cholinesterase  from  two  different  human  tissues,  from 
brain  and  serum,  is  identical.  The  cholinesterase  present  in 
human  serum  is  thought  to  be  synthesized  in  the  liver  (25,26). 
Prody  et  al.  (27)  isolated  cholinesterase  cDNA  from  a  human 
liver  library,  which  demonstrates  that  human  liver  has  mRNA  for 
the  same  protein  found  in  serum.  The  cholinesterase  in  serum  is 
the  soluble,  globular,  tetrameric  G4  form.  Human  fetal  and  adult 
brain  contain  G4  and  G1  forms  of  cholinesterase,  with  G4  being 
the  most  abundant  (28,29).  It  is  not  known  whether  G4  and  Gl  are 
coded  by  the  same  or  different  mRNAs.  Our  cDNA  represents  the 
soluble  G4  form  of  cholinesterase. 

To  date  the  evidence  leads  us  to  expect  that  cholinesterase 
and  acetylcholinesterase  will  be  shown  to  have  different  primary 
amino  acid  sequences.  The  amino  terminus  of  acetylcholinesterase 
from  human  red  blood  cells,  Glu-Gly-Ala-Glu-Asp-Ala  (30),  is 
different  from  the  Glu-Asp-Asp-Ile-Ile-Ile  for  human  serum 
cholinesterase.  The  18  amino  acid  sequence  represented  by  our 
53mer  probe  is  highly  conserved  in  human  cholinesterase  and  in 
Torpedo  and  Drosophila  acetylcholinesterases  (2,8,24,31).  This 
suggests  that  these  same  18  amino  acids  are  likely  to  be 
conserved  in  human  acetylcholinesterase.  Acetylcholinesterase  is 
more  abundant  than  cholinesterase  in  human  brain,  regardless  of 
developmental  stage  (28,29).  For  these  reasons,  we  expected  to 
find  acetylcholinesterase  as  well  as  cholinesterase  cDNA  using 
our  53mer  oligonucleotide  as  a  probe.  However,  we  have  found  no 
acetylcholinesterase  candidate  clones  that  differ  from  the 
sequence  in  Figure  3. 

Two  possible  genetic  variations  were  found  in  clone  Z35. 
Nucleotide  1615  was  A  in  clone  Z35,  but  was  G  in  clone  Z3. 
Nucleotide  1914  was  G  in  Z35,  but  was  A  in  Z3.  Figure  3  shows  G 
at  nucleotide  1615,  coding  for  Ala  539,  because  amino  acid 
sequencing  had  shown  Ala  539.  An  A  at  nucleotide  1615  would  code 
for  Thr.  A  rare  variant  would  not  have  been  detected  by  amino 
acid  sequencing  because  the  cholinesterase  protein  had  been 
purified  from  pooled  human  serum.  The  cDNA  library  was  from  mRNA 
of  a  single  person.  If  the  donor  was  heterozygous,  the  cDNA 
library  would  contain  an  equal  number  of  clones  representing  the 
rare  and  normal  variants.  The  possibility  that  these  nucleotide 
differences  are  cloning  artifacts  must  also  be  considered. 

Upstream  of  the  beginning  of  the  mature  protein,  in  the 
signal  peptide  region,  are  212  nucleotides  coding  for  69  amino 
acids.  There  are  four  ATG  triplets  in  the  same  reading  frame  as 
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the  protein  and  one  ATG  triplet  in  a  different  reading  frame.  In 
95%  of  all  cases  the  ATG  triplet  nearest  the  5'  end  of  the  mRNA 
is  the  initiator  signal  (32).  This  would  suggest  that  ATG  at 
position  -207,  corresponding  to  Met  -69,  could  be  the  codon  which 
initiates  translation.  However,  this  ATG  does  not  lie  in  a 
consensus  sequence  for  initiator  signals.  Only  the  ATG  coding 
for  Met  -28  lies  within  a  favorable  consensus  sequence,  AATATGC, 
commonly  found  in  functional  initiators  (32).  Another  reason 
favoring  Met  -28  as  the  functional  initiator  is  that  signal 
peptides  are  known  to  range  in  length  from  15  to  36  amino  acids 
(33).  A  signal  peptide  with  69  amino  acids  would  be  unusually 
long.  It  is  possible  that  synthesis  of  cholinesterase  is 
initiated  at  several  sites,  and  that  this  has  a  role  in 
regulation  of  expression  (34).  Our  signal  peptide  has  the  three 
components  characteristic  of  all  signal  peptides  (33).  Residues 
-1  to  -5  are  the  c  region  defining  the  cleavage  site  for  signal 
peptidase,  residues  -6  to  -17  are  the  hydrophobic  h  region,  and 
residues  -18  to  either  Met  -28,  Met  -47,  or  Met  -69  are  the  n 
region  containing  a  net  positive  charge. 

The  cDNAs  for  Drosophila  and  Torpedo  marmorata 
acetylcholinesterase  also  have  5'  sequences  with  multiple  ATG 
sites  (24,31).  In  contrast  to  human  cholinesterase  cDNA,  most  of 
these  correpond  to  very  brief  coding  regions  that  are  terminated 
by  stop  codons. 

In  the  3'  untranslated  region,  a  polyadenylation  signal, 
AATAAA,  at  nucleotides  2002-2007,  is  18  nucleotides  away  from  a 
poly (A)  addition  site,  CA,  at  2025.  This  agrees  with  the  10-30 
nucleotide  separation  commonly  found  for  these  sites  (35).  Clone 
Z3  ends  at  nucleotide  2027,  suggesting  that  the  mRNA  for  clone  Z3 
had  a  poly(A)  tail  following  nucleotide  2026  and  suggesting  that 
the  poly(A)  signal  at  nucleotide  2002  was  functional  for  clone 
Z3.  Cleavage  of  pre-mRNA  prior  to  polyadenylation  can  require  the 
presence  of  sequences  rich  in  G/T  about  30  bp  downstream  from  the 
CA  poly (A)  site  (35,36).  Figure  3  shows  that  GTG,  TTTT,  and  TGT 
follow  nucleotide  2025  within  approximately  30  bases.  This  gives 
additional  support  to  the  possibility  that  transcription 
termination  for  the  mRNA  of  clone  Z3  was  directed  by  the  AATAAA 
signal  at  nucleotides  2002-2007. 

Clone  Z35  terminates  at  the  oligo(A)  stretch  shown  in  Figure 
3,  at  nucleotide  2214.  Clone  Z35  may  have  utilized  a  nonconsensus 
poly(A)  addition  signal,  probably  ATTAAA  (35),  at  nucleotides 
2182-2187.  This  would  suggest  termination  at  alternative  poly(A) 
sites  and  generation  of  mRNAs  of  different  lengths  but  identical 
coding  potential. 

Northern  blot  analysis  of  mRNA  from  human  liver  and  human 
brain  showed  six  sizes  for  mRNA,  1.0,  1.45,  2.45,  2.8,  3.4,  and 
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4.2  kb'.  One  of  these,  the  2.45-kb  mRNA,  fits  the  cDNA  reported 
in  Figure  2.  The  1.0-  and  1.45-kb  mRNA' s  are  too  short  to  code 
for  the  574  amino  acids  of  the  cholinesterase  subunit  and  their 
significance  is  unclear.  The  2.8-,  3.4-  and  4.2-kb  mRNA' s  could 
possibly  represent  termination  at  alternative  poly(A)  sites. 
Prody  et  al.  (27)  reported  a  single  mRNA  of  2.5  kb  in  human 
liver  and  human  brain.  However,  multiple  mRNA' s  were  found  in 
Torpedo  californica  and  T.  marmorata  electric  organ,  as  well  as 
in  Drosophila.  The  sTzes  ranged  from  2.2  to  20  kb  in  T. 
californica  T8)  and  were  5.5,  10.5,  and  14.5  in  T.  marmorata 
[24 ).  In  Drosophila  melanogaster  pupae  two  mRNAs  of-  4.5  and  4.8 
kb  were  reported,  as  well  as  fainter  bands  both  above  and  below 
these  sizes  (31). 

The  amino  acid  sequence  of  human  cholinesterase  is  54% 
identical  with  the  amino  acid  sequence  of  acetylcholinesterase 
from  the  electric  organ  of  T^_  californica  (2,8).  The  homology 
extends  throughout  the  two  proteins,  particularly  in  the  region 
of  amino  acids  430-450.  The  sequences  of  human  cholinesterase  and 
D.  melanogaster  acetylcholinesterase  are  38%  identical  (10,31). 
Despite  tne  presence  of  an  active  site  serine  in  the 
cholinesterases,  the  cholinesterases  do  not  have  significant 
sequence  homology  with  the  serine  proteases.  It  appears  that  the 
cholinesterases  are  a  specialized  type  of  serine  esterase  (10), 
differing  from  serine  proteases,  which  include  the  blood 
coagulation  factors  and  other  hydrolytic  enzymes.  To  date  the 
only  protein  outside  of  the  cholinesterase  family  which  is  known 
to  have  significant  sequence  homology  with  the  cholinesterases  is 
thyroglobulin  (2,8). 

Expression  in  bacteria.  When  a  recombinant  plasmid 
containing  full-length  cDNA  for  cholinesterase  was  transfected 
into  bacteria,  the  bacteria  synthesized  cholinesterase.  The  size 
of  the  bacterial  product  was  65,000  daltons  which  is  consistent 
with  a  cholinesterase  subunit  free  of  carbohydrates  and  processed 
by  removal  of  the  signal  peptide.  The  bacterial  product  had  no 
cholinesterase  activity.  This  result  is  not  surprising  as 
bacterial  systems  often  produce  inactive  mammalian  enzymes, 
especially  when  the  mammalian  enzyme  is  a  complex  glycoprotein 
with  many  disulfide  bonds.  The  bacteria  seem  unable  to  form 
correct  disulfide  bonds.  Human  cholinesterase  has  three  internal 
disulfide  bonds  per  subunit  and  one  interchain  disulfide  bond  for 
a  total  of  14  disulfide  bonds  per  molecule  (10).  The  importance 
of  the  carbohydrate  chains  is  unknown,  and  a  protein  made  in 
bacterial  cells  will  not  be  glycosylated.  A  mammalian  expression 
system  is  more  likely  to  yield  active  cholinesterase  enzyme 
(37)  . 

Cholinesterase  gene.  Results  for  the  cholinesterase  gene  are 
incomplete .  We  have  isolated  one  genomic  clone  and  this  clone 
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contains  exons  1  and  2.  Exon  2  is  unusually  long,  containing  1525 
nucleotides.  Exon  2  codes  £or  83%  of  the  mature  cholinesterase 
protein.  Introns  are  located  between  nucleotides  -93  and  -92  in 
the  5'  untranslated  region  and  between  nucleotides  1433  and  1434, 
and  1600  and  1601  in  the  coding  region.  The  gene  contains  a 
total  of  at  least  four  exons  and  three  introns.  A  fifth  exon  is 
expected  based  on  results  from  the  laboratory  of  J.  Massoulie  for 
Torpedo  acetylcholinesterase  (38).  They  found  that  the 
hydrophobic  globular  form  of  acetylcholinesterase  results  from 
alternative  splicing  of  an  exon  coding  for  the  carboxyl 
terminus.  The  alternatively  spliced  exon  must  be  a  fifth  exon. 
We  have  no  information  at  present  regarding  a  fifth  exon  in 
cholinesterase. 

Since  cholinesterase  and  thyroglobulin  have  extensive  amino 
acid  sequence  homology,  it  is  of  interest  to  compare  the  location 
of  introns.  Thyroglobulin  has  2,748  amino  acids  (39)  which  is 
about  5  times  more  than  the  574  amino  acids  of  the  cholinesterase 
subunit.  The  sequence  homology  is  only  with  540  amino  acids  at 
the  carboxyl  terminus  of  thyroglobulin  and,  therefore,  the 
comparison  of  intron  locations  is  limited  to  this  region.  The 
most  striking  result  is  that  thyroglobulin  has  at  least  nine 
introns  within  this  region,  with  one  intron  occurring 
approximately  every  150  nucleotides  (40,41).  In  contrast, 
cholinesterase  has  only  two  introns  within  the  coding  region. 
These  two  cholinesterase  introns  are  located  in  precisely  the 
same  positions  as  the  last  two  introns  of  thyroglobulin.  By 
comparison  with  thyroglobulin  it  seems  that  seven  introns  have 
been  lost  from  the  cholinesterase  gene.  The  seven  fewer  introns 
in  the  cholinesterase  gene  account  for  the  relatively  small  size 
of  the  cholinesterase  gene  whose  minimum  size  is  17  kb.  In 
contrast,  the  size  of  the  thyroglobulin  gene  is  76  kb  for  the 
region  that  corresponds  to  cholinesterase  (40)  and  300  kb  for  the 
whole  thyroglobulin  gene  (42). 
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Fig.  1.  Restriction  man  and  sequencing  strategy  forcholinester- 
ase  cDNA.  Scale  at  too  snows  length  in  kitobases  (kb).  The  open  bar 
represents  the  1722  nucleotides  that  code  for  the  574  amino  acids  of 
mature  cholinesterase.  The  hatched  bar  represents  84  nucleotides  of 
the  signal-peptide  region.  The  solid  bar  at  the  3'  end  represents  492 
nucleotides  of  the  3'  untranslated  sequence.  Restriction  sites  used  for 
sequencing  are  indicated  by  E  (EcoRl),  S  (Sau3Al),  B  (Bam HI),  and 
P {Ssp  I).  Lines  immediately  below  the  restriction  map  show  the  sizes 
and  designations  for  five  cDNA  inserts  in  Xgtll.  The  solid  boxes  are 
oligonucleotide  probes  53, 29,  and  39  nucleotides  in  length.  They  are 
shown  on  the  clones  they  identified.  The  open  boxes  are  cDNA 
probes,  130  and  270  bp  in  length,  prepared  from  clones  Z3  and  Z2, 
respectively.  Lines  with  arrowheads  show  the  direction  and  extent 
of  DNA  sequencing. 
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Fig.  2.  Schematic  diagram  of  clones  used  to  construct 
full-length  cholinesterase  cDNA.  The  Pvu  I I/Bam  HI  fragment  from 
cDNA  clone  Zl3  was  ligated  to  the  Bam  HI/Eco  HI  fragment  of  cDNA 
clone  Z19  to  yield  full-length  cholinesterase  cDNA.  The  Pvu  II 
and  Bam  HI  restriction  sites  are  genuine  sites  in  the 
cholinesterase  gene,  but  the  Eco  Rl  site  at  nucleotide  2650  is 
from  an  Eco  Rl  linker  attached  during  construction  of  the  cDNA 
library.  In  the  diagram  for  full-length  cDNA  the  open  box  from 
nucleotides  1  to  1722  represents  the  complete  coding  region  of 
mature  cholinesterase.  The  filled-in  box  starting  at  the  Pvu  II 
site  is  the  5'  untranslated  region  and  includes  84  nucleotides  of 
the  signal  peptide.  The  filled-in  box  from  nucleotide  1722  to 
the  Eco  Rl  site  is  the  3'  untranslated  region. 
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Fig.  3.  Nucleotide  and  amino  acid  sequence  of  human 
cholinesterase . 
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Thr 

Gly 

Asn 

I  le 

Asp 

Glu 

A  la 

Glu 

T  rp 

Glu 

Trp 

Lys 

Ala 

Gly 

Phe 

His 

Arg 

549 

TTG 

GAA 

ATG 

ACA 

GGA 

AAT 

ATT 

GAT 

GAA 

GCA 

GAA 

TGG 

GAG 

TGG 

AAA 

GCA 

GGA 

TTC 

CAT 

CGC 

1647 

Trp 

Asn 

Asn 

Tyr 

Met 

Met 

Asp 

Trp 

Lys 

Asn 

Gin 

Phe 

Asn 

Asp 

T  yr 

Thr 

Ser 

Lys 

Lys 

Glu 

569 

TGG 

AAC 

A  AT 

TAC 

ATG 

ATG 

GAC 

TGG 

AAA 

AAT 

CAA 

TTT 

AAC 

GAT 

TAC 

ACT 

AGC 

AAG 

AAA 

GAA 

1707 

Ser  Cys  Val  Gly  Leu  ***  574 

AGT  TGT  GTG  GGT  CTC  TAA  TT A AT AGAT T T ACCC TT T AT AGAAC AT A T TTTCCTT T AGA TCAAGGCA A A A A T A T  1780 

CAGGAGCTTTTTTACACACCTACTAAAAAAGTTATTATGTAGCTGAAACAAAAATGCCAGAAGGATAATATTGATTCCT  1859 

CACATCTTTAACTTAGTATTTTACCTAGCATTTCAAAACCCAAATGGCTAGAACATGTTTAATTAAATTTCACAATATA  1938 

A AGTTCTACAGTTA ATTA  TGTGCATA  TTA A A ACA A  TGGCCTGGTTCA ATTTCTTTC  TT  TCCTTaa  taanTT  T  AAGTT  TT  2017 
TTCCCCCCAA A ATTATC AGT  GCTCTGCTTTT  AGTC ACGTGT  AT  T  TTC ATT  ACC ACTCGT  AA A A AGGT  ATCT  T  T  TTTA A A  2096 
TGAATTAAATATTGAAACACTGTACACCATAGTTTACAATATTATGTTTCCTAATTAAAATAAGAATTGAATGTCAATA  2175 
TGAGATATTAAAATAAGCACAGAAAATCAAAAAAAAAAA  2214 

Fig.  3.  Nucleotide  and  amino  acid  sequence  of  human 
cholinesterase .  Nucleotide  I  is  the  G  of  the  GAA  codon  for  the 
amino  terminal  amino  acid  (Glu,  amino  acid  1).  The  active  site 
serine  is  Ser-198.  The  underlined  ATG's  are  possible  translation 
initiation  sites.  The  polyadenylation  signal  (aataaa)  is  shown 
in  lowercase  letters  in  the  3'  untranslated  region.  The 
nucleotide  locations  of  clones  used  to  construct  this  sequence 
are  as  follows:  Z13,  -212-1066;  Z2,  400-1381;  Z3,  583-2027;  OH57, 
625-1341;  and  Z35,  1600-2214.  Nucleotide  1615  (Ala  539)  was  G  in 
clone  Z3  but  was  A  in  clone  Z35.  Nucleotide  1914  was  A  in  clone 
Z3  but  was  G  in  clone  Z35. 
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cDNA  clone  OH57.  Hach  lane  contained  25  fig  of  human  placental 
genomic  ONA  digested  with  EcoRI  (E),  BamHl  (B),  Hiriilll  (H), 
Kpn  I  (K).  Pst  I  (P),  or  combinations  of  these  (EX,  EcoRI  plus  Xba 
I).  Electrophoresis  in  1.2%  agarose  was  followed  by  DNA  transfer  to 
nitrocellulose  filter.  Hybridization  was  at  68°C  for  24  hr  with  probe 
(4  ng/ml)  in  6x  SSC/O.25%  dry  milk.  The  blots  were  washed  in  2x 
SSC/0.1%  NaDodSO*  at  68°C  (3  times.  1  hr  each)  and  exposed  for 


20  hr  with  intensifying  screen. 


a  b  c  d  e 


* 


Fig.  6.  Detection  of  cholinesterase  expression  on  Western 
blot.  Samples  were  first  subjected  to  SDS-PAGE,  then  transferred 
to  nitrocellulose  membrane  and  hybridized  with  cholinesterase 
antibody.  Lane  a  contains  highly  purified  human  serum 
cholinesterase.  Lane  b  contains  human  albumin.  Lanes  c,  d,  and 
e  are  bacterial  cell  lysates.  Lane  c  is  the  cell  lysate  from  a 
control  experiment  in  which  JM109  cells  were  transfected  with 
plasmid  vector  devoid  of  human  DNA.  Lane  d  is  the  cell  lysate 
from  JM109  cells  transfected  with  pBCHE  and  induced  for  one  h 
with  IPTG .  Lane  e  is  the  same  as  d  except  that  no  IPTG  inducer 
was  added.  Cholinesterase  antibodies  recognized  several  bands  in 
lane  d;  the  band  at  65  kdaltons  may  be  intact  cholinesterase 
protein,  while  the  lower  molecular  weight  bands  may  be 
proteolytic  degradation  fragments. 
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Fig.  7.  Restriction  map  of  genomic  clone  P117N.  The  heavy 
bars  are  exons  1  and  2.  Restriction  sites  are  Eco  RI  (E),  Pvu 
II  (P),  Hind  III  (H),  Bam  HI  (B),  and  Sal  I  (S).  Sal  I  marks 
the  boundary  of  the  human  DNA  and  the  phage  vector.  There  are 
additional  Eco  RI ,  Pvu  II,  and  Hind  III  sites  within  the  clone 
that  have  not  yet  been  mapped. 


-31- 


exon  1:  intron  1  :exon  2 

AGAGAATCGG:  ? . cttacctctctttcag  :  AAATCAATA 

-93  -92 


exon  2:  intron  2  :exon  3 

CAAAATATGG :  gtaagtgctga . ?  :  GAATCCAAA 

1433  1434 

Gly ( 478 )  Gly ( 478 ) 


exon  3:  intron  3  :exon  4 

GAAATGACAG :  ? . ttccatattttacag  rGAAATATTG 

1600  1601 

Gly ( 534 )  Gly ( 534 ) 


C  a  tttttttttttt  c 

AG:gta  gt .  n  ag:G 

A  g  cccccccccccc  t 

Consensus  sequence  from  Ref.  22. 


Fig.  8.  Intron/exon  boundaries  of  cholinesterase.  Exon 
sequences  are  In  capital  letters .  intron  sequences  are  in 
lowercase  letters.  Numbers  below  the  DNA  sequence  are  the 
nucleotide  numbers.  The  boundary  for  intron  3  is  from  sequencing 
an  unprocessed  cDNA  clone. 
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Exonl 

ATTTCAAGT1GCTGCTGCCAACTCTCGCGA3CTITGTCA3TAACAGTTGATTGTTACATTCAGTAACACTGA 


Pvu  II 

ATGTCAGTGCAGTCCAATrTACAGGCTGGAGCAGCAGCTGCATCCTGCATTTCCCCGAAGTAT,EACATGATT 

exon  lrintron  1 

TTCACTCCTTGCAAACTTTACCATCTTrGTTGCAGAGAATCGG . tttacagaatcaggtttatt 

tgtttcataatttgtagaattatcaagcatcatattttaggtaattatcatcaataaagtataatatgctat 

-28 

intron  l:exon  2  MetHisSerLysValThrlle 
atgcagaaggcttataaaacattatacttacctctctttcag:AAATCAATATGCATAGCAAAGTCACAATC 

-1+1 

I  leCysI  leArgPheLeuPheTrpPheLeuLeuLeuCysMe  tLeuI  leGlyLysSe  rHi  sltirGluAspAsp 

ATATCCATCAGATTIUICTTrTCGTITCTTTTGCTCTGCATGCTIATrcGGAAGTCAf^TACTGAAGATGAC 

IlellelleAlaThrLysAsnGlyLysValArgGlyMetAsnLeuThrValPheGlyGly'IhrValThrAla 
ATCATAATTGCAACAAAG^TGGAAAAGTCAGAGGGATGftACTTGACAGTTITTGGTGGCACGGTAACAGCC 
Eco  RI  TaqI  50 

PheLeuGlylleProTyrAlaGlnProProLeuGlyArgLeuArgPheLysLysProGlnSerLeuThrLys 
lTrCTTGGftAlTCCCTATGCACAGCCACCTCTTGGTAGACTICGATTCAAAAAG(XACAGTCTCTGACCAAG 

TrpSerAspIleTrpAsnAlalhrLysTyrAlaAsnSerCysCysGlnAsnlleAspGlnSerPheProGly 

TGGTCIGATATTTGGAATGCCACAAAATATGCAAATTCTTGCTGTCAGAACA'EAGATCAAAGTTTTCCAGGC 

PheHisGlySerGluMetTrpAsnProAsmtirAspLeuSerGluAspCysLeuTyrLeuAsnValTrpIle 

TTCCATGGATCAGAGATGTGGAACCCAAACACTGACCTCAGTGftAGACTGTTIATATCTAAATGTATGGATT 

100 

ProAlaProLysProLysAsnAlaThrValLeuIleTrplleTyrGlyGlyGlyPheGlnThrGlyTlirSer 

CCAGCACCTAAACCAAAAAATGCCACTGTATTGATATGGATTTATGGTGGTGGTTTTCAAACTGGAACATCA 

Se  rLeuHi  sValTyrAspGlyLysPheLeuAlaArgvalGluAr  gvall  leValValSe  rMetAsnTyrArg 
TCTTTACATGTTTATGATGGCAAGT'ITCTGGCTCGGGTrGAAAGftGTEATTGTAGTGTa^TGAACTATAGG 
150 

ValGlyAlaLeuGlyPheLeuAlaLeuProGlyAsnProGluAlaProGlyAsnMetGlyLeuPheAspGln 

GTGGGTGCCCTAGGATTCTTAGCTTTGCCAGGAAATCCIXjAGGCTCCAGGGA&CATCGGTTIATTTGATCAA 

GlnLeuAlaLeuGlnTrpValGlnLysAsnlleAlaAlaPheGlyGlyAsnProLysSerValThrLeuPhe 
(^GTroGCTCTTCAGTGGGTTCAAAAAAATATAGCAGCCTTTGGTGGftAATCCTAAAAGTGTAACTCTCTTT 
198  200 

GlyGluSERAlaGlyAlaAlaSe  rValSe  rLeuHi  sLeuLeuSe  r  ProGlySe  rHi  sSe  rLeuPheThrArg 

GGAGAAAGTGCAGGASCAGCTTCAGTTAGCCTGCATriGCTITCTCCTGGAAGCCATrCATTGITCACCAGA 
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Fig.  9.  Partial  nucleotide  sequence  of  the  cholinesterase  gene. 


Bam  HI 

Alai leLeuGlnSe rGlySe rPheAsnAlaProTrpAlaValThr Se rLeuTy rGluAlaAr gAsnArglhr 
GCCAITCTGCAAAGTGGATCCTTTAATGCTCCTrcGGCGGTAACATCTCTTTATGAAGCTAGGAACAGAACG 
255  Xba  I 

LeuAsnLeuAlaLysLeuThrGlyCysSerArgGluAsnGluThrGluIlelleLysCysLeuArgAsnLys 

TIGAACTTAGCTAAATTGACTGGTrGCTCTAGAGAGftATGAGACTGftAATAATQVAGTGTCTTAGAAATAAA 


AspProGlnGluIleLeuLeuAsnGluAlaPheValValProTyrGlyThrProLeuSerValAsnPheGly 

GATCCa^AGAAATTCTTCTGAATGAAGCATTIGTTCTCCCCTATGGGACTCCTITGTCAGTAAACTTTGGT 

300 

ProThrValAspGlyAspPheLeuThrAspMetProAspIleLeuLeuGluLeuGlyGlnPheLysLysThr 

CCGACCXJTGGATGGTGAITnXrrcACTGACATGCCAGACATATTACnTGAACTTGGACAATTTAAAAAAACC 

GlnlleLeuValGlyValAsnLysAspGluGlyThrAlaPheLeuValTyrGlyAlaProGlyPheSerLys 

CAGA'rTTTCGTGGG'IGTTAATAAAGATGAAGGGACAGCTT'rri'iAGTCTA'IGGTGCTCCTGGCTrCAGCAAA 

350 

AspAsnAsnSe  r  I  lei  leThrArgLysGluPheGlnGluGlyLeuLysIlePhePheProGlyvalSe  rGlu 
GATAACAATAGTATCATAACTAGAAAAGAATTTCAGGAAGGTTTAAAAATATTTTTTCCAGGAGTGAGTGAG 

PheGlyLysGluSerlleLeuPheHisTyrThrAspTrpValAspAspGlnArgProGluAsnTyrArgGlu 

T'lTCGAAAGGAArcCA'TCCTTTTTCATTACACAGACTGGGTAGATGATCAGAGACCTGAAAACTACCGTGAG 

400 

AlaLeuGlyAspValValGlyAspTyrAsnPhelleCysProAlaLeuGluPhe'IhrLysLysPheSerGlu 

GCCTTGGGTGATGTTCTTGGGGATTATAATTTCATATGCCCTGCCTTGGAGTTO^CAAGAAGTTCTCAGAA 

TrpGlyAsnAsnAlaPhePheTyrTyrPheGluHi  sArgSe  rSe  rLysLeuProTrpProGluTrpMetGly 
TGGGGAAATAATCCCTTTTTCTACTATTTTGAACACCGATCCTCCAAACTTCCGTGGCCAGAATGGATGGGA 

450 

ValMetHisGlyTyrGluIleGluPheValPheGlyLeuProLeuGluArgArgAspAsnTyrThrLysAla 

GTGATGCATGGCTATGAAATTG^TTTGTCTITGGTTIACCTCTGGftAAGAAGAGATAATIACACAAAAGCC 

exon  2:intron  2 

GluGlul leLeuSe rAr gSe  r I leValLy sAr gTrpAl aAsnPheAlaLysTy rGly 
GAGGAAATTTTGAGTAGATCCATAGTGAAACGTrGGGCAAATrTTGCAAAATATGG :  gtaagtgctgatttt 

intron  2: exon  3 


Sau3A  AsnProAsnGluThr 

ttgtcattgt  tctctggcttggt  ttgatc . GAATCCAAATGftGACT 


500 

GlnAsnAsnSerThrSerTrpProValPheLysSerThrGluGlnLysTyrLeu'IhrLeuAsnThrGluSer 

CAGftACAATAGCACAAGCIGGCCTGTCTTCAAAAGCACTGAACAAAAATATCTAACCTTGAATACAGAGTCA 

ItirArglleMet'IhrLysLeuArgAlaGlnGlnCysArgPheTrpThrSerPhePheProLysValLeuGlu 

ACAAGAATAATGACGAAACTACGTCCTCAACAATGTCGATTCTGGACATCATTITrTCCAAAAGTCTTGGAA 
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exon  3:intron  3 
MetThrGly 

ATGACAG . ctgtactgtgtagttagagaaaatgg 

cttttgtattcgaaattatttttcagttaatgaaacagataaaaattttgattaatacaacttattccatat 

intron3:exon4  550 

AsnlleAspGluAlaGluTrpGluTrpLysAlaGlyPheHisArgTrpAsnAsnTyrMetMet 

tttacagGAAATATTGATGAAGCAGftATGGGAGTGGAAAGCAGGATTCCATCGCTGGAACAATTACATGATG 

574 

AspT  rpLy  sAsnGl  nPheAsnAspTy  r  Th  r  Se  rLysLy  sGluSe  rCysValGlyLeu**  * 
GACTGGAAAAATCAATTIAACGAITACACEAGCAAGAAAGAAAGTTGTGTGGGTCTCTAATIAATAGATTTA 

CCCITIATAGAACATATTTTCCT'ITAGATCAAGGCAAAAATATCAGGAGCTrriTTACACACCTACTAAAAA 

AGTTATTATCTAGCTGAAACAAAAATCCCAGAAGGATAATATTCATTCCTCACATCTTIAACITAGTATTrT 

ACCTAGCATTTO^AAACCC^AATGGCIAGAAGVTGlTEAATIlAAATTrCACAATATAAAGTTCIACAGTTAA 

TTATGTGCATATTAAAACAATGGCCTGGTTCAATTKTITrcTITCC'ITaataaaTTIAAG'ITITrTCCCCCC 

AAAAl^TCAGTGCICTCCTTTIAGTCACGTGTATTTrcATTACCACIO^TAAAAAGGTATCTTrrTTAAAT 

GAATTAAATATlty^AACACTGTACACCATAGTTTACAATAT'rATGTTTCCTAATEAAAATAAGftATrGAATG 

TCAATATGAGATATTAAAATAAGCACAGAAAATCAAAAAAAAAAA 


Fig.  9.  Partial  nucleotide  sequence  of  the  cholinesterase  gene. 
The  first  67  nucleotides  were  obtained  from  a  genomic  clone  and  not  a 
cDNA  clone,  so  it  is  unknown  whether  they  belong  to  exon  1.  A  total 
of  four  exons  and  three  introns  is  identified  above.  Intron  sequences 
are  in  lowercase  letters.  The  mature  protein  begins  at  Glu  -*-1  and 
ends  at  Leu  574.  The  signal  peptide  starts  at  Met  -28.  The  active 
site  serine  is  Ser  198.  unique  Pvu  II,  Eco  RI,  Bam  HI,  and  Xba  I 
restriction  sites  are  underlined,  as  well  as  a  Taq  I  and  Sau  3A  site 
that  were  particularly  useful  for  mapping. 
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Fig.  10.  Tentative  restriction  map  of  the 
cholinesterase  gene.  The  five  exons  are  shown  as  solid 
rectangles .  Tne  Tengths  of  introns  2  and  3  are 
tentative.  The  sizes  are:  exon  1,  121  bp;  intron  1, 
4.0  kb;  exon  2,  1525  bp;  intron  2,  8.5  kb;  exon  3,  167 
bp;  intron  3,  2.3  kb;  exon  4,  604  bp.  Intron  4  and 
exon  5  are  speculative.  Restriction  sites  are  Pvu  II 
(P),  Hind  III  (H),  ECO  RI  (E),  Bam  HI  (B),  and  Taq  I 
(T)  . 
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